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1 4.2BSD and 4.3BSD as examples of the UNIX system 
John S, Quarterman, Abraham Silberschatz, James L Peterson 
December 1985 ACM Computing Surveys (CSUR), volume 17 issue 4 
Publisher: ACM Press 

Full text available: 1 B pdf(4.07 MB) Additional Information: full citation, abstract, references, citings, index 
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This paper presents an in-depth examination of the 4.2 Berkeley Software Distribution, 
Virtual VAX-11 Version (4.2BSD), which is a version of the UNIX Time-Sharing System. 
There are notes throughout on 4.3BSD, the forthcoming system from the University of 
California at Berkeley. We trace the historical development of the UNIX system from its 
conception in 1969 until today, and describe the design principles that have guided this 
development. We then present the internal data structures and ... 

2 pHluid: the desi g n of a parallel functional language implementation on workstations 

Cormac Flanagan, Rishiyur S. Nikhil - , ; ; J / 

June 1996 ACM SIGPLAN Notices , Proceedings of the first ACM SIGPLAN 

international conference on Functional programming ICFP '96, volume 3i 

Issue 6 
Publisher: ACM Press 

Full text available: 1 Bpdff1 .20 MBV : Addjtional Information: full citation , abstract, references , citings, index 

terms 

This paper describes the distributed memory implementation of a shared memory parallel 
functional language. The language is Id, an implicitly parallel, mostly functional language 
that is currently evolving into a dialect of Haskell. The target is a distributed memory 
machine, because we expect these to be themost widely available parallel platforms in 
the future. The difficult problem is to bridge the gap between the shared memory 
language model and the distributed memory machine model. The Ian ... 

Keywords: data flow, garbage collection and run-time systems, parallel and distributed 
implementations 



3 A caching file system for a programmer's workstation prinouies rmt h • j 
^ Michael D. Schroeder, David K. Gifford, Roger M: Needham - 

^ December 1985 ACM SIGOPS Operating Systems Review , Proceedings of the tenth 
ACM symposium on Operating systems principles SOSP '85, volume 19 
Issue 5 
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A coherent distributed file cache with directory write-behind 
Timothy Mann, Andrew Birrell, Andy Hisgen, Charles Jerian, Garret Swart 
May 1994 ACM Transactions on Computer Systems (TOCS), volume 12 issue 2 
Publisher: ACM Press 

Full text available: ■ pdfQ.21 MB) Additional Information: full citation , abstract, references , dtiogs, index 
^ terms , review 

Extensive caching is a key feature of the Echo distributed file system. Echo client 
machines maintain coherent caches of file and directory data and properties, with write- 
behind (delayed write-back) of all cached information. Echo specifies ordering constraints 
on this write-behind, enabling applications to store and maintain consistent data 
structures in the file system even when crashes or network faults prevent some writes 
from being completed. In this paper we describe ... 

Keywords: coherence, file caching, write-behind 



5 A com parison of message passing and shared memory architectures for data parallel 
programs 

A. C Klaiber, H. M. Levy 

April 1994 ACM SIGARCH Computer Architecture News , Proceedings of the 21ST 

annual international symposium on Computer architecture ISCA '94, volume 
22 Issue 2 

Publisher: IEEE Computer Society Press, ACM Press 

Full text available* ISal pdff 1 26 MB) Additional Information: full citation, abstract , references , citings, index 

terms 

Shared memory and message passing are two opposing communication models for 
parallel multicomputer architectures. Comparing such architectures has been difficult, 
because applications must be hand-crafted for each architecture, often resulting in 
radically different sources for comparison. While it is clear that shared memory machines 
are currently easier to program, in the future, programs will be written in high-level 
languages and compiled to the specific parallel target, thus eliminating thi ... 

6 A comparison of MPI. SHMEM and cache-coherent shared address space 
^ programmin g models on the SGI Origin2000 

^ Hongzhang Shan, Jaswinder Pal Singh 

May 1999 Proceedings of the 13th international conference on Supercomputing ICS 
•99 

Publisher: ACM Press 

Full text available: ^ pdf(2.30 MB) Additional Information: full citation , references , citings , index terms 



7 A comparison of three programming models for adaptive applications on the 
Oriqin2000 

Hongzhang Shan, Jaswinder P. Singh, Leonid Oliker, Rupak Biswas 

November 2000 Proceedings of the 2000 ACM/IEEE conference on Supercomputing 
(CDROM) Supercomputing f OO 

Publisher: IEEE Computer Society ( 

Full text available: ^ pdf(239.30 KB) Additional Information: full citation , abstract , references , citings , index 
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Adaptive applications have computational workloads and communication patterns which 
change unpredictably at runtime, requiring load balancing to achieve scalable 
performance on parallel machines. Efficient parallel implementation of such adaptive 
application is therefore a challenging task. In this paper, we compare the performance of 
and the programming effort required for two major classes of adaptive applications under 
three leading parallel programming models on an SGI Origin 2000 syste ... 

8 A cost-comparison approach for adaptive distributed shared memory 
Jai-Hoon Kim, Nitin H. Vaidya 

January 1996 Proceedings of the 10th international conference on Supercomputing 
ICS '96 

Publisher: ACM Press 

Full text available: l j|S) pdf (976.97 KB) Additional Information: full citation , references , citings , index terms 



A framework for delivering multicast message in networks with mobile hosts 
Arup Acharya, B. R. Badrinath 

October 1996 Mobile Networks and Applications, volume l issue 2 
Publisher: Kluwer Academic Publishers 

Full text available: « pdf(438.33 KB) Additional Information: full citation , abstract, references, citings, index 
m terms 

To accommodate mobile hosts (MHs) within existing data networks, the static network is 
augmented with "mobile support stations" (MSSs) that communicate directly with MHs, 
usually via wireless links. Connectivity of the overall network changes dynamically as MHs 
connect to the static network fromdifferent "locations" (MSSs) at different times. 
Compared to their desktop counterparts, mobile tiosts face a newiset of constraints 
namely, low bandwidth of the wireless links9fL — rGl Ori.-;.i 7000 s .v 

10 A high performance cluster JVM presenting a pure single system image 
Y. Aridor, M. Factor, A. Teperman, T. Eilam, A. Schuster 

June 2000 Proceedings of the ACM 2000 conference on Java Grande JAVA 'OO 

Publisher: ACM Press 

Full text available: ^pdf(916.33 KB) Additional Information: full citation , references , citings , index terms 



11 A high-level abstraction of shared accesses 
Peter j. Keleher 

February 2000 ACM Transactions on Computer Systems (TOCS), volume 18 issue i 
Publisher: ACM Press 

Full text available: 1p| pdf(183 57 KB) Additiona ' Information: full citation , abstract , references , index terms . 

review 

We describe the design and use of the tape mechanism,. a r nevy hjgh4eyel abstraction of 
accesses to shared data for software DSMsJ Japes 'consolidate ancf generalize a number of 
recent protocol optimizations, including update-based locks and recorded-replay barriers. 
Tapes are usually created by "recording" shared accesses. The resulting recordings can be 
used to anticipate future accesses by tailoring data movement to application semantics. 
Tapes-based mechanisms a ... 

Keywords: DSM, programming libraries, shared memory, update protocols 



http://portal.acm.org/ra^ 3/] 2 /07 



Results (page 1): local cache and message and version and copy 



Page 4 of 6 



12 A message driven OR-parallel machine 
S. A. Delgado-Rannauro, T. J. Reynolds 

April 1989 ACM SIGARCH Computer Architecture News , Proceedings of the third 
international conference on Architectural support for programming 
languages and operating systems ASPLOS-III volume 17 issue 2 
Publisher: ACM Press 

Full text available: *^ pdf(1.29 MB) Additional Information: full citation , abstract , references , index terms 

A message driven architecture for the execution of OR-parallel logic languages is 
proposed. The computational model is based on well known compilation techniques for 
Logic Languages. We present first the multiple binding mechanism for the OR-parallel 
Prolog architecture and the corresponding OR-parallel abstract machine is described. A 
scheduling algorithm which does not rely upon the availability of global data structures to 
direct the search for work is discussed. The message driven proce ... 

13 A Performance Evaluation of the Convex SPP-1000 Scalable Shared Memory 
Parallel Computer 

Thomas Sterling, Daniel Savaresse, Peter MacNeice, Kevin Olson, Clark Mobarry, Bruce 
Fryxell, Phillip Merkey 

December 1995 Proceedings of the 1995 ACM/IEEE conference on Supercomputing 
(CDROM) - Volume 00 Supercomputing '95 

Publisher: ACM Press 
Full text available: ^ pdf(457.1 1 KB) 

htmi(2.67 KB) Additional Information: full citation , references , citings 
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14 A q uantitative analysis of cache policies for. scalable network file systems 
Michael D. Dahlin, Clifford J. Mather, Randolph Y. Wang, Thomas E. Anderson, David A. 
Patterson 

May 1994 ACM SIGMETRICS Performance Evaluation Review , Proceedings of the 
1994 ACM SIGMETRICS conference on Measurement and modeling of 
computer systems SIGMETRICS '94> Volume 22 issue 1 . 

Publisher: ACM Press 

Full text available: fS] pdf(1.42 MB) Additional Information: full citation , abstract , references , citings, index 
. i£j h terms 

Current network file system protocols rely heavily on a central server to coordinate file 
activity among client workstations. This central servercan become a bottleneck that limits 
scalability for environments with large numbers of clients. In central server systems such 
as NFS and AFS, all client writes, cache misses, and coherence messages are handled by 
the server. To keep up with this workload, expensive server machines are needed, 
configured with high-performance CPUs, memory systems, ... 

15 A quantitative comparison of parallel computation models 
Ben H. H. Juurlink, Harry A. G. Wijshoff • 

August 1998 ACM Transactions on Computer Systems (TOCS), volume 16 issue 3 
Publisher: ACM Press 

Full text available* I bSI pdf(1 .06 MB) Additional Information: full citation , abstract , references , citings , index 
™ terms 

In recent years, a large number of parallel computation models have been proposed to 
replace the PRAM as the parallel computation model presented to the algorithm designer. 
Although mostly the theoretical justifications for these models are sound, and many 
algorithmic results where obtained through these mo|dels /r nWe has been 

conducted to validate the effectiveness of tKese models /fq^developing cost-effective 
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algorithms and applications on existing hardware platforms. In ... 
Keywords: parallel computation models, performance evaluation 

16 A quorum-consensus replication method for abstract data ty pes ||§j 
Maurice Herlihy 

February 1986 ACM Transactions on Computer Systems (TOCS), volume 4 issue l 
Publisher: ACM Press 

Full text available: f B-pdfd.66 MB) Addltional Information: full citation, abstract , references , citings, index 

terms , review 

Replication can enhance the availability of data in distributed systems. This paper 
introduces a new method for managing replicated data. Unlike many methods that 
support replication only for uninterpreted files, this method systematically exploits type- 
specific properties of objects such as sets, queues, or directories to provide more effective 
replication. Each operation requires the cooperation of a certain number of sites for its 
successful completion. A quorum for an operation is any s ... 

17 A reliable multicast webcast protocol for multimedia collaboration and caching ||§ 
L. Kristin Wright, Steven McCanne, Jay Lepreau 

October 2000 Proceedings of the eighth ACM international conference on Multimedia 
MULTIMEDIA '00 

Publisher: ACM Press 

Full text available: ^ pdf(1.10 MB) Additional Information: full citation , abstract , references , index terms 

Large-scale, multi-point, multimedia conferencing applications designed to facilitate long- 
distance collaboration are enjoying growing popularity. Usually composed of real-time 
audio, video and shared-drawing applications, these collaborative environments help 
render the geographical location of collaborators irrelevant. To complement these existing 
collaborative applications, it would be useful to Jiaye Jfte ability to distribute documents 
synchronously over the World Wide Web (WWW). One model 

18 A scalable Web cache consistency architecture j 
Haobo Yu, Lee Breslau, Scott Shenker _ t . . . 
August 1999 ACM SIGCOMM Computer Communication Review /Proceedings of the 

conference on Applications, technologies, architectures, and protocols 
for computer communication SIGCOMM '99, volume 29 issue 4 
Publisher: ACM Press 

Full text available: H Bpdfd. 71 MB) Additional Information^ full citation , abstract / references , citings , index 
^ < terms 

The rapid increase in web usage has led to dramatically increased loads on the network 
infrastructure and on individual web servers. To ameliorate these mounting burdens, 
there has been much recent interest jn web caching .architectures Web 
caching reduces network load, server load/ and the latency of responses. However, web 
caching has the disadvantage that the pages returned to clients by caches may be stale, 
in that they may not be consistent with the version currently o ... 

19 A structural view of the Cedar programming environment - %t ' m ^ ~ r • §|j 
Daniel C. Swinehart, Polle T. Zellweger, Richard J. Beach, Robert B. Hagmann 
August 1986 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 8 Issue 4 

Publisher: ACM Press ^ r,, . c 

Full text available: fQ pdf(6.32 MB) Additional Information: full citation , abstract , references , citings , index 
^ terms 

This paper presents an overview of the Cedar programming environment, focusing on its 
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overall structure— that is, the major components of Cedar and the way they are 
organized. Cedar supports the development of programs written in a single programming 
language, also called Cedar. Its primary purpose is to increase the productivity of 
programmers whose activities include experimental programming and the development of 
prototype software systems for a high-performance personal computer. T ... 

20 A survey of commercial parallel processors 

Edward Gehringer, Janne Abullarade, Michael H. Gulyn 

September 1988 ACM SIGARCH Computer Architecture News, volume 16 issue 4 
Publisher: ACM Press 

Full text available: ^ pdf(2.96 MB) Additional Information: full citation , abstract , citings , index terms 

This paper compares eight commercial parallel processors along several dimensions. The 
processors include four shared-bus multiprocessors (the Encore Multimax, the Sequent 
Balance system, the Alliant FX series, and the ELXSI System 6400) and four network 
multiprocessors (the BBN Butterfly, the NCUBE, the Intel iPSC/2, and the FPS T Series). 
The paper contrasts the computers from the standpoint of interconnection structures, 
memory configurations, and interprocessor communication; Also, the share ... 
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Efficient cooperative caching in ad hoc networks. 
Conference information 

2006 1st International Conference on Communication System Software and Middleware, New Delhi, 

India, 8-12 Jan. 2006. 
Source 

2006 1st International Conference on Communication System Software and Middleware (IEEE Cat. No. 

06EX1230C), 2006, p. 8 pp., 26 refs, pp. CD-ROM, ISBN: 0-7803r9574-3. 

Publisher: IEEE, Piscataway, NJ, USA. 
Author(s) 

Chand-N , Joshi-R-C f Misra-M . 
Author affiliation 

Chand, N., Joshi, R.C., Misra, M., Dept. of Electron. * Comput.Eng., Indian Inst, of Technol., Roorkee, 
India. . > 

Abstract 

Caching of frequently accessed data in multi-hop ad hoc environment is a potential technique that can 
improve the data access performance and availability. Cooperative caching, which allows the sharing 
and coordination of cached data among clients, can further explore the potential of the caching 
techniques. In this paper, we propose a novel scheme, called zone cooperative (ZC) for caching in 
mobile ad hoc networks. In ZC scheme, one-hop neighbors of a mobile client form a cooperative cache 
zone since the cost for communication with them is low both in terms of energy consumption and 
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message exchange. As a part of cache management, cache admission control and VALUE based 
replacement policy are developed to improve the data accessibility and reduce the local cache miss 
ratio. An analytical study of ZC based on data popularity, node density and transmission range is also 
performed. Simulation experiments show that the ZC caching mechanism achieves significant 
improvements in cache hit ratio and average query latency in comparison with other caching 
strategies. 
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Abstract 

Caching of frequently accessed data in ad hoc networks is a potential technique that can improve the 
data access performance and availability. Cooperative caching, which allows the sharing and 
coordination of cached data among clients, can further explore the potential of the caching techniques. 
In this paper, we propose a novel scheme, called zone cooperative (ZC) for caching in mobile ad hoc 
networks. In ZC scheme, one-hop neighbors of a mobile client form a cooperative cache zone since 
the cost for communication with them is low both in terms of energy consumption and message 
exchange. For a data miss in the local cache, each client first searches the data in its zone before 
forwarding the request to the next client that lies along routing path towards server. As a part of 
cache management, cache admission control and replacement policy are developed to improve the 
data accessibility and reduce the local cache miss ratio. Simulation experiments show that the ZC 
caching mechanism achieves significant improvements in terms of cache hit ratio and average query 
latency in comparison with other caching strategies. 
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In open multi-agent systems, agents need resources provided by other agents but they are not aware 
of which agents provide particular resources. We consider a peer-to-peer approach, in which each 
agent maintains a local cache with information about k resources, that is for each of the k resources, 
an agent that provides it. However, when an agent or a resource moves, cache entries become 
obsolete. We propose a suite of cache update policies that combine pull-based invalidation that is 
initiated by the agent that maintains the cache with push-based invalidation that is initiated by the 
agent that moves. We study and compare variations of oblivious flooding-based push/pull along with 
an informed push approach where each agent maintains a list of the agents that have it cached. Our 
experimental results indicate that a novel variation of flooding for push where a moving agent 
propagates its new location to agents in its old neighborhood achieves good cache consistency with a 
small message overhead. The proposed policies are suitable for any peer-to-peer system where peers 
cache information about other peers and this information becomes obsolete. 
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2003. 

Sponsor(s): IEEE Comput. SocTech. Committee on Parallel Process; IEEE Comput. Soc. Tech. 
Committee on Comput. Archit; IEEE Comput. Soc. Tech. Committee on Distrib. Process; ACM 
SIGARCH. 
Source 

Proceedings International Parallel and Distributed Processing Symposium, 2003, p. 8 pp., 13 refs, pp. 

CD-ROM, ISBN: 0-7695-1926-1. 

Publisher: IEEE Comput. Soc, Los Alamitos, CA, USA. 
Author(s) 

Dimakopoulos-V-V , Pitoura-E . 
Author affiliation 

Dimakopoulos, V.V., Pitoura, E., Dept. of Comput.. Sci., Ioannina Univ., Greece. 
Abstract 

In open multi-agent systems agents need resources provided by other agents but they are not aware 
of which agents provide the particular resources. Most solutions to this problem are based on a central 
directory that maintains a mapping between agents and resources. However, such solutions do not 
scale well since the central directory becomes a bottleneck in terms of both performance and reliability. 
In this paper, we introduce a different approach: each agent maintains a limited size local cache in 
which it keeps information about k different resources, that is, for each of k resources, it stores the 
contact information of one agent that provides it. This creates a directed network of caches. We 
address the following fundamental problem: how can an agent that needs a particular resource find an 
agent that provides it by navigating through this network of caches? We propose and analytically 
compare the performance of three different algorithms for this problem, flooding, teeming and random 
paths, in terms of three performance measures: the probability to locate the resource, the number of 
steps and the number of messages to do so. Our analysis is also applicable to distributed search in 
unstructured peer-to-peer networks. 
Descriptors 
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The costs of using JXTA. 
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Proceedings Third International Conference on Peer-to-Peer Computing (P2P2003), Linkoping, Sweden, 
1-3 Sept. 2003. 

Sponsor(s): IEEE Sweden; Linkopings Univ; IISLAB (Intelligent Inf. Syst. Lab.). 
Source 

Proceedings Third International Conference on Peer-to-Peer Computing (P2P2003), 2003, p. 160-7, 15 
refs, pp. x+222, ISBN: 0-7695-2023-5. 
Publisher: IEEE Comput. Soc, Los Alamitos, CA, USA. 
Author(s) 

Halepovic-E , Deters- R . 

Editor(s): Shahmehri-N f Graham-R-L , Carroni-G . 
Author affiliation 

Halepovic, E., Deters, R., Dept. of Comput. ScL, Saskatchewan Univ., Saskatoon, Sask., Canada. 
Abstract 

Project JXTA is an open-source effort to specify the standard protocols for peer-to-peer communication 
and collaboration. We propose a JXTA performance model and present results obtained by 
benchmarking the JXTA 1.0 reference implementation in Java. We focus on the performance evaluation 
of typical peer operations and consequences for the peer network, the user and the developer. The 
important trade-off between peer startup latency and the maintenance of the local cache is shown 
and discussed. The throughput limits of pipes, the core JXTA communication concept, are also 
measured in a LAN environment for smooth and bursty traffic. The results indicate that the limiting 
factor for reliable throughput is the number of messages rather than size in bytes, as well as that 
small JXTA messages carry an excessive overhead of control data. Important performance issues and 
trade-offs are identified and explored, as a basis for the formulation of guidelines for system designers 
and simulation-based research of JXTA networks. 
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performance-model; Java; peer-startup-latency; local-cache; LAN-environment; JXTA-message; 
simulation-based-research. 
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Proceedings IEEE International Conference on Cluster Computing. CLUSTER 2000, 2000, p. 27-34, 11 
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Arantes-L f Sens-P , Folliot-B . 
Author affiliation 

Arantes, L., Sens, P., Folliot, B., LIP6 Lab., Paris VI Univ., France. 
Abstract 

As interconnected local-area workstation networks are widely available, the idea of offering a software 
distributed shared memory (SDSM) layer across them is quite an attractive alternative for compute- 
intensive applications. However, the higher cost of sending a message over an inter-cluster link than 
over an intracluster can limit applications' performance on a multicluster SDSM system. In this paper, 
we present the extensions that we have added to TreadMarks SDSM in order to adapt it to a loosely- 
coupled cluster-based platform. We have implemented a logical per-cluster cache in order to exploit 
cluster locality. By accessing its local cache, a processor can share data previously requested by 
another processor of its cluster, thereby hiding the cost of inter-cluster communication. 
Descriptors 
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Shared memory versus message passing for iterative solution of sparse, irregular problems. 
Source 

Parallel Processing Letters, {Parallel-Process-Lett-Singapore}, March 1999 # vol. 9, no. 1, p. 159-70, 15 
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Publisher: World Scientific, Singapore. 
Author(s) 

Chong-F-T , Aqarwal-A . 
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Chong, FT., Dept. of Comput. Sci., California Univ., Davis, CA, USA. 
Abstract 

The benefits of hardware support for shared memory versus those for message passing are difficult to 
evaluate without an in-depth study of real applications on a common platform. We evaluate the 
communication mechanisms of the MIT Alewife machine, a multiprocessor which provides integrated 
cache-coherent shared memory, message passing, and DMA. We perform this evaluation with "best- 
effort' implementations which solve several sparse, irregular benchmark problems with a 
preconditioned conjugate gradient sparse matrix solver (ICCG). We find that machines with fast global 
memory operations do not need message passing or bulk transfer to support our irregular problems. 
This is primarily due to three reasons. First, a 5-to-l ratio between global and local cache misses 
makes memory, copies in bulk communication expensive relative to communication via shared memory. 
Second, although message passing has synchronization semantics superior to shared memory for 
data-driven computation, efficient shared memory can overcome this handicap by using global read- 
modify-writes to change from the traditional owner-computes model to a producer-computes model. 
Third, bulk transfers can result in high processor idle times in irregular applications. 
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Publisher: Springer-Verlag, Germany. 
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Keller-A-M P Basu-J . 
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Keller, A.M., Dept. of Comput. Sci., Stanford Univ., CA. 
Abstract 

We propose a new client-side data-caching scheme for relational databases with a central server and 
multiple clients. Data are loaded into each client cache based on queries executed on the central 
database at the server. These queries are used to form predicates that describe the cache contents. A 
subsequent query at the client may be satisfied in its local cache if we can determine that the query 
result is entirely contained in the cache. This issue is called cache completeness. A separate issue, 
cache currency, deals with the effect on client caches of updates committed at the central database. 
We examine the various performance tradeoffs and optimization issues involved in addressing the 
questions of cache currency and completeness using predicate descriptions and suggest solutions that 
promote good dynamic behavior. Lower query-response times, reduced message traffic, higher server 
throughput, and better scalability are some of the expected benefits of our approach over commonly 
used relational server-side and object ID-based or page-based client-side caching. 
Descriptors 
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Proceedings 22nd Annual International Symposium on Computer Architecture (IEEE Cat. 

NO.95CB35801), 1995, p. 48-59, 40 refs, pp. xiii+426, ISBN: 0-89791-698-0. 
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Lebeck-A-R , Wood-D-A . 
Author affiliation 

Lebeck, A.R., Wood, D.A., Dept. of Comput. Sci., Wisconsin Univ., Madison, WI, USA. 
Abstract 

The paper introduces dynamic self-invalidation (DSI), a new technique for reducing cache coherence 
overhead in shared-memory multiprocessors. DSI eliminates invalidation messages by having a 
processor automatically invalidate its local copy of a cache block before a conflicting access by 
another processor. Eliminating invalidation overhead is particularly important under sequential 
consistency: where the latency of invalidating outstanding copies can increase a program's critical 
path. DSI is applicable. to software, hardware, and hybrid coherence schemes. We evaluate DSI in the 
context of hardware directory-based write-invalidate coherence protocols. Our results show that DSI 
reduces execution time of a sequentially consistent full-map coherence protocol by as much as 41%. 
This is comparable to an implementation of weak consistency that uses a coalescing write-buffer to 
allow up to 16 outstanding requests for exclusive blocks. When used in conjunction with weak 
consistency DSI can exploit tear-off blocks-which eliminate both invalidation and acknowledgment 
messages-for a total reduction in messages of'up to 26%. 
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Singh-J-P r Hennessy-J-L f Gupta-A . 
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Singh, J. P., Hennessy, J.L., Gupta, A., Comput Syst. Lab., Stanford Univ., CA, USA. 
Abstract 

To design effective large scale multiprocessors, designers need to understand the characteristics of the 
' applications that will use the machines. One important class of applications is based on hierarchical N 
body methods. The article examines the key architectural implications of representative applications 
that use the two dominant hierarchical N body methods: the Barnes-Hut Method (3.E. Barnes, P. Hut, 
1989) and the Fast Multipole Method. We first show that exploiting temporal locality on accesses to 
communicated data is critical to obtaining good performance on these applications and then argue that 
coherent caches on shared address space machines exploit this locality both automatically and very 
effectively. Next, we examine the implications of scaling the applications to run on larger machines. We 
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use scaling methods that reflect the concerns of the application scientist and And that this leads to 
different conclusions about how communication traffic and local cache and memory usage scale than 
scaling based only on data set size. In particular, we show that under the most realistic form of scaling, 
both the communication to computation ratio as well as the working set size grow slowly as larger 
problems are run on larger machines. Finally, we examine the effects of using the two dominant 
abstractions for interprocessor communication: a shared address space and explicit message passing 
between private address spaces. 
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Keller, A.M., Dept. of Comput. ScL, Stanford Univ., Palo Alto, CA, USA. 
Abstract 

We propose a new client-side data caching scheme for relational databases with a central server and 
multiple clients. Data is loaded into a client cache based on queries, which are used to form predicates 
describing the cache contents. A subsequent query at the client may be satisfied in its local cache if 
we can determine that the query result is entirely contained in the cache. This issue is called cache 
completeness'. On the other hand, cache currency deals with the effect of updates at the central 
database on the client caches. We examine various performance and optimization issues involved in 
addressing the questions of cache currency and completeness using predicate descriptions. Expected 
benefits of our approach over commonly used object ID-based caching include lower query response 
times, reduced message traffic, higher server throughput, and better scalability. 
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Abstract 

The method steps include (a) locally caching documents and updating local (ogs and directories, (b) 
responsive to updating of the local cache documents through a counterpart terminal, comparing the 
form and content identities of the changed document with designated other cache directories, and (c) 
upon a match entering only the updates, upon a complete mismatch entering the entire document and 
updates, and upon a forms identity match entering only the content updates. 
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